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The full sky cosmic microwave background polarization field can be decomposed into 'electric' 
and 'magnetic' components. Working in harmonic space we construct window functions that allow 
clean separation of the electric and magnetic modes from observations over only a portion of the 
sky. We explicitly demonstrate the method for azimuthally symmetric patches, but also present it in 
a form in principle applicable to arbitrarily-shaped patches. From the window functions we obtain 
variables that allow for robust estimation of the magnetic component without risk of contamination 
from the probably much larger electric signal. The variables have a very simple noise properties, 
and further analysis using them should be no harder than analysing the temperature field. For an 
azimuthally-symmetric patch, such as that obtained from survey missions when the galactic region 
is removed, the exactly-separated variables are fast to compute. We estimate the magnetic signal 
that could be detected by the Planck satellite in the absence of extra-galactic foregrounds. We 
^ , also discuss the sensitivity of future experiments to tensor modes in the presence of a magnetic 

O ' signal generated by weak lensing, and give lossless methods for analysing the electric polarization 

' field in the case that the magnetic component is negligible. A series of appendices review the spin 

weight formalism and give recursion relations for fast computation of the spin-weighted spherical 
harmonics and their inner products over azimuthally-symmetric patches of the sphere. A further 
appendix discusses the statistics of weak signal detection. 
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I. INTRODUCTION 



Observations of fluctuations in the temperature of the cosmic microwave background (CMB) are now providing us 
with a direct view of the primordial inhomogeneities in the universe. The power spectrum of temperature fluctuations 
, yields a wealth of information on the nature of the primordial perturbations, and the values of the cosmological 
rS i' parameters. Mapping the polarization of the cosmic microwave sky is an important next step, offering a great deal 
' [ of complementary information, especially regarding the character of the primordial inhomogeneities Q. One of the 
^ ■ most interesting questions to resolve is whether the primordial perturbations possessed a tensor (gravitational wave) 
] component, as predicted by simple inflationary models. Here, polarization measurements offer a unique probe ^, Q. 
■ Polarization of the cosmic microwave sky is produced by electron scattering, as photons decouple from the primordial 
plasma. Linear polarization is produced when there is a quadrupole component to the flux of photons incident on 
a scattering electron. Scalar (density) perturbations generate an 'electric' (gradient) polarization pattern on the sky 
' due to gradients in the velocity field on the surface of last scattering. For scalar perturbations the velocity field is 
H I curl-free, and this leads directly to the production of an entirely 'electric' pattern of linear polarization. In contrast, 
tensor perturbations (gravitational waves) produce polarization by anisotropic redshifting of the energy of photons 
through decoupling. In this case the polarization has 'magnetic' (i.e. curl) and 'electric' (i.e. gradient) components 
at a comparable level. A magnetic signal can also be produced by weak lensing of the electric polarization generated 
by scalar modes. Detection and analysis of the lensing signal would be interesting in itself, but a detection of an 
additional tensor component would provide strong evidence for the presence of primordial gravitational waves, a 
generic signature of simple inflationary models. 

Detecting or excluding a magnetic component is clearly of fundamental significance in cosmology. But there is 
a significant obstacle to be faced. The problem is that for the foreseeable future, the primordial sky polarization 
will only be observable over the region of the sky which is not contaminated by emission from our galaxy and other 
foreground sources of polarization. Thus we shall only be able to measure the polarization over a fraction of the sky. 
But the electric/magnetic decomposition is inherently non-local^ and non-unique in the presence of boundaries. 
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To understand this, consider the analogous problem of representing a vector field Vl (in two dimensions) as a 
gradient plus a curl: 

V, = V,<i> + V,x, (1) 

the electric and magnetic components respectively. From this equation, one has — V*Vi, and V^x = ^'^^^/^j- 
For a manifold without a boundary, like the full sky, the Laplacian may be inverted up to a constant zero mode, and 
the two contributions to Vi are uniquely determined. But for a finite patch, one can always think of adding charged 
sources for the potentials $ and x outside of the patch on which Vi is measured, which alter <I> and x without changing 
Vi. For example one can add to $ and x pieces with equal but perpendicular gradients so there is no net contribution 
to V. 

Since full sky observations are unrealistic, so is the hope of a unique decomposition of the sky polarization into 
electric and magnetic components. However, this does not at all mean that the hypothesis of a magnetic signal cannot 
be tested. One possibility is to construct a local measure of the magnetic signal by differentiating the measured 
polarization (i.e. Vej-'V^- vanishes if Vi is pure electric in the analogue example above), but this is problematic for 
noisy, sampled data. A more promising alternative, which avoids differentiating the data, is to construct line integrals 
of the polarization For example, in the vector analogy above, any line integral f dx^Vi is guaranteed to vanish if 

Vi is purely electric. However, the problem with these line integrals is that there are an infinite number of them, and 
they are not statistically independent. One would therefore prefer a set of 'magnetic' variables to which the 'electric' 
component does not contribute, but which are finite in number and statistically independent, for a rotationally 
symmetric statistical ensemble. Since polarization from a primordial scale invariant spectrum of gravitational waves 
is predominantly generated on scales of a degree or so (the angle subtended by the horizon at last scattering) , we would 
expect to be able to characterize the cosmic magnetic signal by a set of statistically independent variables roughly 
equal in number to the size of the patch in square degrees. However the signal within a degree or so of the boundary 
cannot be unambiguously characterized as magnetic, and hence one loses a number of variables proportional to the 
length of the boundary. The amount of information about the magnetic signal therefore increases as the patch area 
minus the area of this boundary layer. In this paper we shall find the set of observable 'magnetic' variables explicitly 
for circular sky patches: the method may be generalized to non-circular patches if necessary. 

As mentioned above, the electric component of the polarization (due primarily to scalar perturbations) is expected 
to be much larger than the magnetic signal. Therefore to detect the latter it may be useful to construct observables 
which suffer no electric contamination. We show how to construct such variables, and use them to estimate what 
magnitude of magnetic signal the planned Planck satellite^ might be able to detect. We also discuss the optimal 
survey size for future experiments aimed at detecting tensor modes via magnetic polarization, including the effects of 
'magnetic noise' due to weak lensing of the dominant electric polarization |^. Even for observations that do not 
expect to detect the magnetic signal the magnetic-only observables are likely to be very useful in checking consistency 
of any residual polarization with noise or indeed in identifying foreground contamination. They may also be useful 
for studying the small scale weak lensing signal |^ . 

To construct variables that depend only on the electric or magnetic polarization we integrate the polarization field 
over the observed patch with carefully chosen spin- weight 2 window functions. We present a harmonic-based approach 
for constructing these window functions which is exact in the limit of azimuthally-symmetric patches. The method is 
expected still to perform well for arbitrary shaped patches of the sky, but the separation will no longer be exact in that 
case. Constructing the window functions with our harmonic method automatically removes redundancy due to the 
finite size of the patch, keeps the information loss small (except for very small patches) , and ensures that for idealized 
noise in the polarization map (isotropic and uncorrelated), the noise on the electric and magnetic variables preserves 
these properties. In this respect the construction is analogous to the orthogonalized harmonics approach used in the 
analysis of temperature anisotropics |l^ . However in the polarized case there is no simple interpretation in terms 
of a set of orthogonalized harmonics. 

In Ref. p2[ it was shown how the lossless quadratic estimator technique can be applied to polarization. There, 
no attempt was made to separate the electric and magnetic contribution to the estimators, so the resulting window 
functions for the power displayed considerable leakage between the electric and magnetic modes. The authors of 
Ref. showed how the leakage could be reduced, but it is arguably still too large to allow robust estimation of 
the magnetic signal in the presence of an electric signal that is orders of magnitude larger. We are able to perform 
a much cleaner separation at the level of the harmonic components in the map, and as we shall see the information 
loss in our approach is quite small for full sky surveys with a galactic cut. 
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The electric-magnetic decomposition of the polarization field is exactly analogous to the corresponding decomposi- 
tion of projected galaxy ellipticities induced by weak lensing Ref. [|5| shows how to construct local real-space 
correlation functions for measuring the magnetic component. These are useful for distinguishing the purely electric 
signal due to gravitational lensing from intrinsic correlations in galaxy alignments, and the method has the advantage 
of working for arbitrarily shaped regions of sky. However the method assumed a flat sky approximation, and includes 
only the two-point information. For polarization observations the sky curvature will be important and we aim to 
extract a set of statistically independent observables that contain as much of the magnetic information as possible. 
This may also prove useful for weak lensing studies. 

The paper is arranged as follows. In Sec. || we present the spin-weight 2 window technique for separating electric 
and magnetic polarization on the sphere, generalizing results in Refs. P, ^. Section [II describes our harmonic-based 
technique for constructing window functions with the properties required to ensure separation of the electric and 
magnetic modes while keeping information loss small. Classical techniques for testing the hypothesis that there is no 
magnetic signal are discussed in Sec. IV, and estimates of the detection limits with the Planck satellite and future 
experiments are also given. Lossless methods for estimation of the polarization power spectra are contrasted with 
methods using the separated variables in Sec. In a series of appendices we outline our conventions for spin weight 
functions and their spherical harmonics. In addition we present a number of the standard integral theorems on 2- 
dimensional manifolds in convenient spin weight form, and present recursive methods for the fast computation of the 
spin weight spherical harmonics and their inner products over azimuthally symmetric patches of the sphere. A further 
appendix discusses the statistics of detecting weak signals from tensor modes. 



II. ELECTRIC AND MAGNETIC POLARIZATION 



The observable polarization field is described in terms of the two Stokes' parameters Q and U with respect to a 
particular choice of axes about each direction on the sky. In this paper we take these axes to form a right-handed 
set with the incoming radiation direction (following Ref. Q). The real Stokes' parameters are conveniently combined 
into a single complex field that represents the observed polarization 

P = Q + iU. (2) 

The values of Stokes' parameters depend on the choice of axes; since Q is the difference of the intensity in two 
orthogonal directions it changes sign under a rotation of 90° . The Q field is related to the U field by a rotation of 45° . 
More generally under a right-handed rotation of the axes by an angle a about the incoming direction the complex 
polarization transforms as P — > e~^'"P and is therefore described as having spin minus two (see Appendix ^ for our 
conventions for spin weight functions). The analysis of polarized data is therefore rather more complicated than for 
the temperature which does not depend on making a choice of axes in each direction on the sky. 

As described in Appendix one can define spin raising and lowering operators that can be used to relate quantities 
of different spin |p^ , |l7| . The spin raising operator is denoted 9 and the lowering operator 9 . Since the polarization 
has spin-weight -2 it can be written as the action of two spin lowering operators on a spin zero complex number 

P= 9 g(Ps + zPB). (3) 

The underlying real scalar (spin-zero) fields Pe and Pb describe electric and magnetic polarization respectively [ pT[ . 
They are clearly non-local functions of the Stokes' parameters. One can define a spin zero quantity which is local in 
the polarization by acting with two spin raising operators. Using some results from Appendix ^ one obtains 

9 9P= (V2 + 2)v2(Pb + zPb) (4) 

where V is the covariant derivative on the sphere. The real and imaginary parts of this equation can therefore be 
used to determine the electric and magnetic parts of the polarization. Performing a surface integral we define 

r 2 w = / dS'iy* 9 9P, = / dS-iy* 9 gp*, (5) 



Js JS 



where ly is a complex window function defined over some patch S of the observed portion of the sky. It follows that 

= h(^2,W + ^'-2,w)^ = ~^h{^2.W ~ ^-2,w) (6) 
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provide a measure of the el ectric and magnetic signals. Note that E'y^r, = E'^^ with an equivalent result for B'-^y. 
Using the integral theorem ( A23 ) in Appendix ^ we can write 



= [dSP*2W*+(f idl {W*BP* - P*dW*) , (7) 

Js JdS 

IL2W = [dSP_2W*+(f idl {W*dP- P dW*), (8) 

Js JdS 

where 2W = d dW is now a spin 2 window function, = 9 is a spin —2 window function, and idl — idl * 

is the spin 1 element of length around the boundary dS of S. Clearly we do not wish to take derivatives of noisy 
observed data and hence it is usually useful to choose the window function to eliminate the derivative terms on the 
boundary. 

For CMB polarimetry we are interested in the polarization defined on the spherical sky. The surface integrals 
vanish if we choose W such that B BW — B dW — 0, which will be true if is a linear combination of the spherical 
harmonics with / = or 1, since these possess no spin 2 component. If we then set W = on the boundary, so as to 
eliminate the derivatives of the polarization, we are forced to consider circular patches S, in which case a combination 
of the two rn = harmonics works. This implies that the electric and magnetic signals can be probed by performing 
line integrals around circles, as emphasized in Refs. ^. These line integrals can be performed around any circle 
that is enclosed in the observed region of the sky, and it is unclear how to obtain a complete set of statistically 
independent observables in order to extract all of the available information. Also for current experiments, performing 
one-dimensional line integrals on pixelized maps is unlikely to be a good way to extract information robustly. 

In this paper, we suggest choosing the window functions so that the line integrals around dS that appear in the 
construction of and contain no contribution from the magnetic and electric polarization respectively. In the 
absence of special symmetries (see below for exceptions that arise in the case of circular patches) this requires that W, 
9 W, and 9 W all vanish on the boundary. These conditions are equivalent to demanding that the window function 
and its normal derivative vanish on dS. With such a choice of window we can measure the electric and magnetic 
signals using only the surface integrals 

I±2,w = Ew±iBw= [ dS ±2W*{QTiU). (9) 
Js 

Since the window functions are scalar functions on the sphere we can expand them in spherical harmonics, 

^ = EE i/lifw^-^-- (10) 

l>2 \m\<l V ^ '' 

(The square root factor is included for later convenience.) We need not include / = and 1 spherical harmonics 
since they do not contribute to the spin-weight ±2 window functions, and the boundary integral terms automatically 
separate for these multipoles. In practice, we are only interested in probing scales to some particular /max (e-g- the 
magnetic signal from tensor modes has maximal power for I « 100 and decreases rapidly with /), so the sum in 
Eq. ( |l0| ) can be truncated at some finite Zmax- 

We shall focus on the case where the observed sky patch is azimuthally symmetric in which case the construction 



of exact window functions becomes particularly simple. The harmonic-based method we describe in Sec. Ill provides 
a practical solution to constructing a non-redundant set of window functions that separate the electric and magnetic 
modes exactly. In addition, for the special case of isotropic, uncorrelated noise on the observed polarization, these 
simple properties are preserved in the variables E\y and B\y. For observations over non- azimuthally symmetric 
patches our method can, of course, be used over the largest inscribed circular patch, but in this case there is inevitable 
information loss since we use only a subset of the observed data. However, we expect that the method presented 



in Sec. Ill could also be applied directly to the full observed region to construct window functions that achieve 
approximate separation of electric and magnetic polarization. 

Consider the case of an azimuthally-symmetric patch so the boundary dS consists of one or two small circles. For 
each azimuthal dependence on m we can construct combinations 

fl \ jr^i^Wl^Yim (11) 




that satisfy the necessary boundary conditions. For m = it is easy to see that Eyir and B\y contain no contribution 
from Pb and Pe respectively for any choice of the Wio [i.e. the boundary integrals that distinguish Ew (Bw) from 
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E[y (B^y) vanish if the polarization is pure magnetic (electric)]. It follows that for m = there are Zmax — 1 linearly 
independent window functions that satisfy the boundary conditions. For \m\ = 1 it will be shown in the next section 
that there is only one independent linear constraint per boundary circle, so there are ^max — 2 possible window functions 
(^max — 3 for a boundary composed of two circles). For |m| > 2 there are two linear constraints per boundary circle 
which can be taken to be the vanishing of Wm and its normal derivative. In this case there are Imax — |w| — 1 
(^max — {iTil — 3) window functions for boundaries consisting of one (two) small circles. 

Since we are only considering a fraction of the sky not all of the window functions counted above may return 
observables Ew and Byy containing independent information. This arises because for large /max, or small patches, 
there will generally arise non-zero window coefficients Wim that produce spin 2 window functions that are poorly 
supported over the patch. (See e.g. Ref. for a discussion of the equivalent problem in the case of scalar functions.) 
The redundancy in the set of acceptable window functions can be removed by expanding the spin 2 window functions 
in a smaller set of functions which are (almost) complete for band-limited signals over the region S. The construction 
of such a set by singular value methods (e.g. Rcfs. |l8|) forms the starting point of the method we present in the 
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A. Harmonic expansion 

We construct window functions in harmonic space, so as a useful preliminary we consider the harmonic expansion 
of spin- weight 2 fields over the full sphere |^. The polarization P = Q + iU is spin —2 and can be expanded over 
the whole sky in terms of the spin two harmonics (see Appendix |^ for our conventions and some useful results) 

Q±iU = ^ a^2,l7n T2^m = ^{Elm T iBlm)^2Ylm- (12) 
Im Irn 

Reality of Q and U requires a*_2 = (— l)™a2,z(-m)j so that — (— l)™£^;(_m) with an equivalent result for B/m- 
Under parity transformations Eim {-lYEim but Bim (-l)'+^i3;,„, since s>lm(7r -0,4> + tt) = {-lYsYimiO, cj)). 
From the orthogonality of the spherical harmonics over the full sphere it follows that 

Ei^ = \{a2,Vm + a.2M)^h I ASP-2Yil, + \ j dSPW;^, (13) 

J Ait J in 

iBim = Ua2,im-a-2,irn)^~h f d5P_2>^z:„ + ^ / dSP*2Y:^. (14) 



2\'-^^^im ^—z.imj — 2 1 —^-^Lrn ' 2 

Jin J A-K 



In a rotationally-invariant ensemble, the expectation values of the harmonic coefficients define the electric and 
magnetic polarization power spectra: 

{^I'm'^lm) — Sl'lSm'mCf'^ , {B*, ^, Bim) — ^I'l^m'mC^^ . (15) 

If the ensemble is parity-symmetric the cross term is zero, {El,^,Bim) = 0. 

The form of the harmonic expansion (nQ) of the window function ensures that the spin-weight ±2 windows are 



±2W^ = ^W^im±2yim, (16) 
hn 

where the sum is over I > 2 and \m\ < I. Evaluating the surface integrals in Eq. (|^) we find 

Ew = ^ Wil^Elrn, Bw = ^ Wil^Blm, (17) 

Irn Im 

where the pseudo-harmonics are obtained by restricting the integrals in Eqs. ( p^ and (|lj) to the region S: 

Elm = \ I K^i'™' - ^Bv^,)^2Yv^^^2Yil, + {Ev,n' + iBv m')2Yl' m' 2Y1U , (18) 

I'm' •'^ 

Blm ^ [ '^^ [(^''"' " ^Bi,rr.')-2Yl'm'-2Yi*rn - (El'm' + iBv rn')2Yl' m' 2YCJ . (19) 

I'm' 

Defining Hermitian coupling matrices 

W±(i,n)(lm)' = 5(2l^(;m)(;m)' ± -2 W^(;m) (/m)' ) , (20) 
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FIG. 1: The window functions H^+{;'m){im) (solid lines) and W-(^iim.){im) (dashed lines) for I' = 100 and various m for an 
azimuthally symmetric patch with 9 < 10°. The dashed lines show the Eim contamination of Bii^ as a function of I. For m — 
there is no contamination, and as m increases the functions decrease in amplitude as the corresponding harmonics become 
more localized outside of the patch. 



where 

sW(^ljn){lm)' = / dS sYi^^sYl'm', (21) 

Js 

we can write 

Elm = ^ (W^+(/m)(/m)'£^/'m' + «M^-(/m)(/m)'-B/'m') , (22) 
I'm' 

Blm = {W+(im){lm)'Bl'm' — 'iW^(im){lm)'El'm') ■ (23) 

I'm' 

In the hmit ^max ~^ oo, ±2W^(im)(im)' become projection operators as a consequence of the completeness of the spin- 
weight harmonics. The matrix W^(^im){im)' controls the contamination of Ew and Bw with magnetic and electric 
polarization respectively. Our aim is to construct window functions Wim that remove this contamination for all Eim 
and Blm- Some elements of the matrices W±i^im){im)' are shown in Fig. |l]. 

For azimuthally-symmetric patches the coupling matrices are block diagonal (W±(^im){im)' dmm'), and so window 
functions can be constructed for each m separately [see Eq. (^l])]. For m = we have 

f dS2Yi,o2Yi*o^ f dS-2Yi>o-2Yil (24) 
Js Js 
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soW. 



(;o)(i'o) 



and we have clean separation for any azimuthally-symmetric window function. The set of azimuthally 



symmetric window functions gives /max~ 1 separated variables that contain the same information as would be obtained 
by computing line integrals around all those circles concentric with the boundary of the azimuthal patch. For general 
m there is leakage of Eim into Bim', for parity-symmetric cuts there is only leakage between modes with different 
parity (i.e. for even I the pseudo-harmonics Bim depend on Eiim only for odd I'). 

We showed in the previous section that, for a general window function, the contamination of e.g. Ew by the 
magnetic polarization is due entirely to boundary terms. This implies that can always be written as a 

line integral around the boundary of S. (We show in Appendix |Q that the matrices sY{im){im)' can be transformed 
into line integrals for I ^ I'. However W^_(;m)(;m)' can be written as a line integral for all I and /'.) Making use of 
Eq. (|C9|), it is straightforward to show that 



{lm){lm)' 




(25) 



This can be put in manifestly Hermitian form using the recursion relation derived from the action of ( 9 — 3 ) on sYi 
m + s cos 6 



smt 



sYlrn = s){l + S+l)s+lYl,n + ^ ^/ (l + s){l - S + I) s-lYlr, 



For a circular boundary at constant latitude 6 — Q (i.e. the boundary of an azimuthal patch), we find 

W.{im){i',n) = -4nnT[ui{m)ui,{m) + vi{m)vl,{m)], 



where the vectors 




[9ei^/„(e,$)-cotey,™(e,$)], 



(/- 2)! v^m^ 



-il™(e,$) 



(26) 
(27) 

(28) 
(29) 



for / > 2 and some arbitrary (f>. [Note that ui{in) and vi{m) will not generally be orthogonal so Eq. (|27|) is not the 
spectral decomposition of W-(j^rn)(i'm)-\ Any window Wim whose inner products with ui{m) and vi{m) both vanish, 
i.e. 



0, 



(30) 



will achieve clean separation of electric and magnetic polarization. For \m\ > 1 such window functions and their normal 
derivative necessarily vanish on the boundary. As noted earlier, for |m| = 1 there is actually only one constraint to 
be satisfied which now follows from the fact that vi{ztl) = 0. 



III. ELECTRIC/MAGNETIC SEPARATION ON THE CUT SKY 

In this section we give a practical method for constructing a non-redundant set of window functions {W/} where 
I labels the particular window, that achieve exact separation for azimuthal patches. The corresponding (cleanly 
separated) electric and magnetic observables will be denoted Ewj and Bwi- We will make use of a notation where 
vectors are denoted by bold Roman font, e.g. B^y has components -B^/j, and B has components -B/m, and matrices 
are denoted by bold italic font, e.g. W± have components W±(^ijn)(^ijny . We present the method in a form that is 
applicable (though no longer exact) to arbitrary shaped regions S; for azimuthal patches the method is exact. For the 
azimuthal case all matrices are block diagonal and the window functions can be constructed for each m separately. 

In matrix form, Eq. ([l7| ) is 



(31) 
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FIG. 2: The distribution of eigenvalues of W+ for two azimuthally-symmetric sky cuts with Zmax = {250, 1000}. The distribution 
is approximately bimodal, and the fraction of the eigenvalues corresponding to weU-determined modes (eigenvalue significantly 
non-zero) is given by the fraction of the sky area in the patch in the limit Zmax oo. 



where W — Wi(^im) is the matrix whose Ith row contains the harmonic coefficients of the /th window function, and 
recall 

E^W+Il + iW-B, B^W+B-iWH,. (32) 

For an azimuthally-symmetric sky patch the block-diagonal matrices ±2^^ (with components ±2W^(im)(i'm)'5mrni) 
from which W± are constructed can be computed very quickly using the recursion relations given in Appendix y. 



Alternatively, W_ can be computed directly from Eq. (E7 



). In the limit of full sky coverage W_ and W+ 



We know that for |rn| > 2, the range of mth submatrix of W is two-dimensional [spanned by ui{m) and vi{m)\^ 
so that all but two of the eigenvalue of the submatrix are zero. Equivalently, all but two linear combinations of the 
Bim are independent of Ei>rn- The \m\ = 1 submatrices of W- have only one non-zero eigenvalue; the associated 
eigenvectors are u;(±l). The m = submatrix is identically zero. The essence of our method for constructing the 
window functions is to choose W to project out of the range of 

We first diagonalize W+ = U+D+Ul, by performing a singular value decomposition Q. Here, D+ is a positive 
(semi-) definite diagonal matrix whose elements are the eigenvalues of W+. The columns of the unitary matrix C/+ are 
the normalized eigenvectors of W+. The singular value decomposition allows us to identify the linear combinations 
C/| B that are poorly determined by B — those corresponding to the small diagonal elements of The eigenvectors 
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with very small eigenvalues correspond to polarization patterns that essentially have no support inside the observed 
patch of the sky, and would lead to a set of redundant window functions if not removed from the analysis. The 
distribution of eigenvalues of W+ is approximately bimodal as illustrated in Fig. ^, and the exact definition of 'small' 
is not critical when considering the range of the matrix. This bimodality arises because ±2W are approximately 
projection operators for large Imax, and the fact that the range of WL is a rather small subspace. To remove 
redundant degrees of freedom from the spin 2 window functions, we define an operator U+ which projects onto the 
eigenvectors of W+ whose eigenvalues are close to one. This amounts to removing the appropriate columns of U+. 
Since U+ is orthogonal, C7+ is column orthogonal and hence U'^U+ = I (but 7^ I). The matrix £)+ is the 

corresponding smaller square diagonal matrix, and we have 

C7|b « b+ill^ - iillw^^. (33) 

We now multiply by £>^^^^, defined by [D^^^'^]ij = (5^ [£)+],^"^^^, to give 

£>+^/^C/|B w £>+/^C/|B - i£);^/^C/|W_E. (34) 

Including the factor U\_ in the window functions W is equivalent to constructing the spin 2 window functions 

from a reduced basis that is orthonormal and (almost) complete over the region S. 

The remaining step is to project out the unwanted boundary terms that contaminate B with E. For the case of an 
azimuthal patch, the ranges of the submatrices of W- are all two dimensional (or lower). It follows that the same 

~ 1 /2 ~ t 

is true of D;"'"J7|W_, so we can remove the unwanted boundary term by ensuring that W also includes a factor 
that projects out of the range of ibj_^^'^Xj]^W-. In practice, we perform this projection by constructing its singular 
value decomposition, which for a non-square matrix takes the form Z)^^^^i/^W_ — UDV^ . Here, £) is a diagonal 
matrix with the same dimension as D+, (7 is a unitary matrix of the same dimension, and V is a column orthogonal 
rectangular matrix. There are at most two non-zero singular values (elements of the diagonal matrix D) per m, and 
the corresponding left singular vectors (columns of U) form an orthonormal basis for the range 
We can project out of this range by defining U as the matrix obtained by removing the columns of U where the 
corresponding singular value is non-zero. Thus, choosing the window functions 

W* = U^D^^^^Ul, (35) 

we guarantee separation of the electric and magnetic polarization for azimuthally-symmetric patches. Our separated 
polarization observables become 

Bw = W*B = U^D^^^^uIb^U''d]^^uIb, (36) 

Ew = W*E = U^D^^^^uIe,^U^D]^^uIe. (37) 

For azimuthal patches the separation is exact; the approximation sign arises only from our use of w+ w ir+D+ul 
in simplifying the matrix that premultiplies E and B in Eqs. ( p6| ) and (|37|). 

For observations over non-azimuthal patches, one can either apply the exact separation over the largest inscribed 
azimuthal region or attempt to apply the method outlined above to the entire patch. In the latter case the structure 
of W- is less clear, but we can still expect a significant number of its eigenvalues to be very small; the associated 
eigenvectors correspond to window functions that satisfy W ~ ~ (5W on the boundary. The number of independent 
window functions that achieve separation can be estimated as the number of pixels of linear size ~ Tr/Zmax contained 
in the patch (roughly the number of modes that survive the diagonalization of W+) minus twice the number of pixels 
on the boundary (roughly the number of constraints in setting W and 3 to zero on the boundary). Such window 
functions will only give an approximate separation of electric and magnetic polarization. In practice, the accuracy of 
the separation, and the number of independent window functions constructed, will depend on the choice of threshold 

~ —1/2 ~ ^ 

for retaining the singular values of U^W^. 

If we are only interested in constructing variables that depend on the magnetic polarization, the maximum multipole 
^max in the window functions can be chosen rather small (of the order of a few hundred). The relation B = Tl^+B — 
iW-E will only hold for square W- if Zmax is chosen to include all the significant power in the electric polarization, so 
for smaller Imax we cannot assume that W_ is square. However, for azimuthal patches, the range of each submatrix 
of W is still guaranteed to be of dimension two or less, irrespective of its shape, and so the exact separation can 
proceed with W treated as square. For non-azimuthal patches it would be prudent to monitor the effect of varying 
the number of columns in W on the range of this matrix. 
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A. Noise 

It is straightforward to project the errors on the Stokes' parameters to find the noise in the separated variables Ew 
and B^. In this section we consider the simple case of maps with idealized noise properties. 

We assume that the noise correlation between pixels is negligible, and that the noise on the Stokes' parameters 
AQ and AU is un-correlated. The neglect of noise correlations between pixels amounts to assuming white noise in 
the time stream of the measurement. In principle, correlations between Stokes' parameters can be eliminated with a 
careful choice of polarimeter directions in the experiment [ p^ pO[ . With these assumptions, we have 

{AQ{n)AU{n')) = 0, (38) 

and the noise correlation is given by 

{AQ{n)AQ{n')) = a^gSin ~ n'), {Au{n)Au{n')) = al5{n ~ n'). (39) 

It follows that the noise AB on the pseudo-multipoles B has correlations 
(ABABt> = i / d5(a^ + 4)i[_2Y*(_2Y*)t+2Y*(2Y*)t]-i / AS {^l ~ <jI)\UY* {^Y*)^ + ^Y* U^^^ 

JS JS 

(40) 

with a similar result for the noise AE on E. Here, vectors of spin- weight s functions have components aYim- The 
polarimeter arrangements that give uncorrelated errors between Stokes' parameters also ensure that alj = a'q = aj^ 
so that the last integral in Eq. (^) is zero. Here we concentrate on the simple case where a% is isotropic in which 
case 

(ABABt) = (AEAEt) a%W+. (41) 

The covariance of the noise ABvi/ on Bvi/ is therefore given by 

AT = (ABi^ABI^) = W*{ABAB''){W*y « a%tj'< D~^^^UlU+D+UlU+D~^^^U = all, (42) 

and hence the noise is diagonal (and isotropic); similarly (AEvkAE]^) = (J%I. What is more, the noise on Ew and 
BvF are uncorrelated for isotropic noise since 

( AEABt) ^icT^^W-, (43) 

and hence 

(AEvvABI^,) = lalU^ D^^^^ulW-U+D^^^^U = (44) 

as annihilates D^^^'^U^^W^. For isotropic noise our polarization variables therefore have the same desirable 
diagonal properties as the scalar diagonalized harmonic coefficients used in the analysis of the cut-sky CMB temper- 
ature |ll|. 

In the presence of a symmetric beam, white noise in the time stream of the experiment projects to white (though 
generally non-isotropic) noise on the beam-convolved polarization field. In multipole space, the convolved fields 
have electric and magnetic multipoles that are related to the unconvolved Eim and Bim by spin-weight 2 beam 
window functions 2W1 pl| , |2^ . For Imax of a few hundred, appropriate for probing magnetic polarization, the beam 
window functions will be negligible for experiments with resolution much better than one degree. For lower resolution 
experiments, or higher ^max, the effect of the beam window function should be included in the theoretical (signal) 
covariance of the variables E^y and Bi^ (see below). 

In general, non- uniform coverage of the sky will lead to variations in afj. In this case it is still possible to 
define harmonic variables B^^^ = AT^^/^Bvy that have isotropic noise. (Here AT-i/^ = UnD^^^'^uI where AT = 
UnDnUJ^, Un is unitary and Djq is diagonal.) For azimuthal patches with general (everywhere finite) noise patterns 
we can still construct the magnetic-only variables for each value of m but the noise will now couple variables with 
different m (unlike the signal). While presenting no fundamental obstacles, this m-mode coupling does increase the 
computational overhead considerably. 

Sky patches of general shape are equivalent to azimuthal patches with regions of infinite noise, and the general case 
of a non-azimuthally symmetric survey region with arbitrary noise can be treated by re-defining the pseudo-harmonics 
E, B and coupling matrices W+, W- with a factor of l/cr^ inside the integral. All the above results then follow with 
the new definitions, though computing the matrices and manipulating them may become computationally challenging. 
The assumption of isotropic noise is therefore not fundamental to our analysis, and the following results could be 
generalized for more realistic situations. 
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FIG. 3: The real space window functions for an azimuthally symmetric sky patch with 6 < 10°. They are evaluated in the 
frame where the signal is diagonal, so the leftmost window produces the largest signal for that m. The signal to noise falls off 
as a function of m as shown in Fig. ^, For m > the window functions correspond to the real part of B[^'; the imaginary part 
is a rotated version of the same window. The length of the lines shows the sampling weight at that point, and the orientation 
of the lines shows which polarization orientation gives maximal contribution. 



B. Real space window functions 



The expected magnetic signal correlation matrix is given by 



(BwB 



BB 



(45) 

Since the noise 



where the diagonal magnetic power spectrum matrix is given by [C — 

correlation is proportional to the identity matrix for isotropic noise we can perform any rotation B^y '^w' = -RBw, 
where R is unitary, and stiU have a set of variables with uncorrelated errors. The rotated variables are derived from 
window functions R*W. For a particular theoretical model we can rotate to the frame where the signal matrix is 
diagonal. The rotated Bj^-* will then be fully statistically independent. In Fig. ||we plot the window functions for the 
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which give the largest contributions to the signal for a typical flat ACDM model with a scale invariant tensor 



initial power spectrum and no reionization. The window functions are plotted as line segments of length ^yQ 
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FIG. 4: The eigenvalues oi N-'^^^SN^'^^'-^ at a given m with the tensor amphtude chosen to give a detection at 99% confidence 
with probabihty 0.5 (see Sec. |[v| ). The noise is assumed isotropic and the model has reionization aX z — 6.5. For the small sky 
patch most of the signal is in the lowest few eigenmodes of each m, but for larger patches a greater number of eigenmodes are 
required to encompass all the signal (in the bottom plot there are lots of contributions on top of each other along the bottom 
line). For the large patch there are a small number of high signal to noise modes due to the extra large scale reionization signal. 



at angle tan ^{JJw IQw^l^ to the 9 direction where the real quantities Qw and Uw are defined in terms of the real 
part WM'^^^ of the (rotated) scalar window function VF(^) as 

Qw + iUw = -i'^^^W^^\ (46) 

This definition ensures that 

^^^^^ = I AS{QwQ + VwV)- (47) 

JS 

For the imaginary part of b[^\ Qw and Uw should be defined as Qw + iVw = -i3 931^(^'*. For the case of 
azimuthal patches, as considered in Fig. |^ where the windows are constructed for each m, the imaginary part would 
produce a plot that is rigidly rotated by — 7r/(2m) (m ^ 0) about the centre. Plotting the window functions in this 
form is useful since the length of the line segment gives the sampling weight assigned to that point, and the orientation 
gives the direction of the linear polarization that contributes at each point. We could repeat the exercise for the Ew 
in which case for the real part we would define Qw + iUw =33 WM'^^^ . 

In Fig. U we show the signal to noise in the magnetic variables for two azimuthal patches. As the patch size increases 
the signal in the modes with large m also increases, reflecting the fact that for small patches the diagonalization of 
W+ removes a greater relative fraction of the modes at each m as m increases. For small patches of the sky most of 
the signal at each m is compressed into a small number of modes, whereas for larger patches the signal is distributed 
more uniformly. For cosmological models with reionization the signal for large patches is distributed less uniformly, 
with a small number of modes giving big contributions due to the greater large scale power. 



IV. MEASURING THE MAGNETIC SIGNAL 



We are now in a position to use the magnetic observable Bvi/ to constrain the magnetic signal without having to 
worry about contamination with the much larger electric signal. The simplest thing to do would be to test the null 
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hypothesis that the magnetic signal is due entirely to noise (this hypothesis is unlikely to be ruled out pre-Planck) . If 
the signal were not consistent with noise it could indicate various things: the presence of CMB magnetic polarization, 
the presence of polarized foregrounds, that haven't been removed successfully, systematic leakage into the magnetic 
mode in the analysis (e.g. due to unaccounted for pointing errors, or pixelization effects), or Q-U leakage in the 
observation (e.g. due to unaccounted for cross-polarization in the instrument optics). 

Magnetic polarization can originate from tensor modes, but also by weak lensing of the scalar electric polariza- 
tion 123]. The lensing signal should be dominant on small scales, and the magnetic variables could certainly be used 
to observe this signal. Of more interest here is the larger scale contribution from tensor modes. In order to identify 
this component we shall have to model the lensing contribution, which becomes increasingly important as one tries to 
observe smaller tensor contributions . In the first three of the following subsections we assume that the magnetic 
signal is generated purely from the tensor modes, then in subsection IV D we show how our results can be adapted to 
account for the lensing signal. 



A. Is it just noise? 



If the noise and signal are Gaussian the B^i/ will be Gaussian and the simplest thing to do is a test by computing 
— B|y7V~^Bvi/ (for isotropic noise this is just = 'B\^'Bw / ■ Whilst the CMB magnetic polarization signal 
is from tensor modes is expected to be Gaussian, the lensing signal and any spurious or unexpected signal may not 
be. One may therefore also wish to do a more sophisticated set of statistical tests at this point. 

Assuming that the signal is as expected — any B signal present is Gaussian and would have a power spectrum 
as predicted for a near scale-invariant tensor initial power spectrum — one can account for the expected form of 
the power spectrum and thereby increase the chance of a detection. We assume that the main parameters of the 
universe are well determined by the time magnetic polarization comes to be observed, so the shape of the magnetic 
polarization power spectrum is known to reasonable approximation (the only significant freedom arising from the 
shape of the primordial tensor power spectrum). We compute the expected signal correlation S for some particular 
tensor amplitude and say that the real signal is rS. Assuming Gaussian signal and noise the likelihood in this case is 
then given by 

I A exphiBt,,(iV + r5)-iBvy] 

The likelihood distribution can be computed numerically from the Bvi/ observed, and gives the posterior probability 
distribution on the value of r after multiplying by the prior /(r). 

The magnetic signal is expected to be weak, and the detailed statistics for analysing such a signal are given in 
Appendix O. There we show that 

^\^N-^SN-^^w-tr{N-^S) ^^^^ 



gives a measure of the number of 'sigmas' of the detection — the number of standard deviations of the maximum 
likelihood f from pure noise (r — 0) assuming low signal to noise. We use this as a test statistic in Monte-Carlo 
simulations to compute detection probabilities at a given significance. We have checked at isolated points that using 
optimal statistics gains very little except for very small sky patches (where there are only a small number of magnetic 
modes, each of which must have fairly high signal to noise in order to get a detection). 

Using the B^/ variables is clearly not optimal as we have thrown away some well determined linear combinations 
of E and B. However in the idealized situation considered here they should provide a robust way for testing for 
magnetic polarization. The number of modes thrown away is in any case quite small — not more than two per m 
mode for azimuthal patches. We quantify this information loss further in Sec. 0. 



B. Detection by Planck? 



Of the current funded experiments, only Planck is likely to detect magnetic polarization if the levels are as predicted 
by standard cosmological models. As a toy model we consider the 143 and 217 GHz polarized channels of the Planck 
High Frequency Instrument. We approximate the noise as isotropic and ignore the variation of beam width (7.1 
and 5.0 arcmin full width at half maximum respectively) between these channels. Combining maps from these two 
channels with inverse variance weighting, we find crjv ~ 6 x 10~'^/^K/K, where Q and U are expressed as dimensionless 
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FIG. 5: The probability of detecting magnetic polarization at 99 per cent (thick lines) and 95 per cent (thin lines) confidence as 
a function of the tensor initial power spectrum amplitude At for the model of Planck observations described in the text. The 
dotted line is for a model with reionization at z — 6.5, using the unprojected variables and treating the electric contamination 
as part of the noise (see Sec. 0). The other curves are using the projected magnetic variables for models with no reionization 
(solid) and reionization at z = 6.5 (dashed). The vertical dashed line show the tensor amplitude that would contribute about 
1/10 of the COBE signal 



thermodynamic equivalent temperatures in units of the CMB temperature. We apply an azimuthally-symmetric 
galactic cut of 20 degrees either side of the equator. The expected magnetic polarization power spectrum peaks at 
I ~ 100, and there is therefore no need to consider high resolutions so we can use Zmax = 250 without significant loss 
of power. In Fig. ^ we show the probability of obtaining a detection with Planck as a function of the true underlying 
scale- invariant tensor power spectrum amplitude At (defined as in Rcf . ) assuming a standard fiat ACDM model. 

A tensor amplitude of At « 4 x lO^^'' would contribute about 1/10 of the large scale temperature C; detected by 
COBE, and is likely to be detected by Planck if our model is at all realistic. This corresponds to being able to detect 
the signal from infiationary models with energy scale at horizon crossing 

> 2 X 10^6 GeV. Such models include 

the simple 0" potentials, with n > 2. 



C. Survey size 



For a given detector sensitivity the magnitude of the signal that can be detected depends on the size of the sky 
patch that is observed. The signal to noise in each observable increases in proportion to the observation time per 
unit area. The noise covariance is proportional to cr^ which varies in proportion to the observed area for a given 
survey duration. For large areas the number of observables varies approximately in proportion to the area, which 
would make the number of 'sigmas' of a chi-squared detection scale with the square root of the area. Combining these 
two effects, the expected detection is therefore proportional to one over the square root of the area, and is larger if a 
fixed survey time is spent observing a smaller area. However for smaller areas the signal to noise on each observable 
becomes larger, and the number of variables decreases. With fewer variables the probability of obtaining no detection 
increases significantly. This is just the fact that if you observe a small patch of sky you have a larger chance of being 
unlucky and having a patch which has a small magnetic polarization signal everywhere. Also the existence of the 
boundary becomes increasingly important for small patches and a larger fraction of the information is lost in order 
to obtain clean separation of the magnetic observables. 

The question of 'optimal' survey size is somewhat delicate, as it depends on the probability distribution for the 
detection significance that one thinks is optimal. In Fig. ^ we plot the probability of detecting various tensor amplitudes 
at 95 per cent and 99 per cent confidence for different survey sizes. In Fig. |7| we show the minimum gravitational 
wave (tensor) amplitude that might be detected at 99 per cent confidence as a function of the radius of the survey 
size. It is clear that radii in the range 5°-9° are optimal, though one cannot be more precise without defining more 



15 




radius of observed area (degrees) 

FIG. 6: The probability of being able to rule out the null hypothesis at 95 per cent (dashed lines) or 99 per cent (solid lines) 
confidence for scale invariant tensor amplitudes (bottom to top) of At = {1,2,4,8, 16} x 10~^^, for a survey with detector 
sensitivity s = 10 fiKy'^ec that runs for one year and maps a circular patch of sky of a given radius assuming uniform noise, no 
reionization, and no lensing. The probabilities were computed by Monte-Carlo simulation. 



specifically the aims of the observation. A radius of about 7° would be a good compromise between being able to 
place good upper limits if there was no detection (which favours radii closer to 9°) and having a better chance of 
detecting small amplitudes (which favours smaller radii). 

The solid curves in Fig. |^ fully take account of the need to separate the magnetic signal from the (much larger) 
electric signal. By way of comparison, the dashed curves show the minimum detectable amplitude obtainable if one 
could do perfect lossless separation, which is clearly impossible on an incomplete sky (see Sec. With lossless 
separation, the best upper bounds are obtained for smaller patches since the size of the boundary is no longer 
important. The dashed curves in Fig. |^ can be compared with those given in Rcf. ||2^ where perfect separation was 
assumed, the effects of finite sky coverage were treated only approximately, and a less rigorous approach to hypothesis 
testing was employed. Ref. |^ gives an improved analysis along the lines of Ref. and also performs calculations 
properly taking account of the mixing of electric and magnetic polarization through a (brute-force) Fisher analysis in 
pixel space. 

D. Lensing 

Unlike most of the foreground signals that might contaminate the observation, the magnetic signal from the lensing 
of scalar electric polarization has the same frequency spectrum as the primordial magnetic signal and so cannot be 
removed easily by use of multi-frequency observations. In order to isolate the tensor contribution to the magnetic 
signal we can incorporate knowledge of the expected lensing power spectrum into the null-hypothesis covariance 
matrix TV (we neglect the non-Gaussianity of the lensed polarization). For the multipoles / < 250 of interest for the 
tensor signal the lensing signal is approximately white, with Cj^,^ « 4.4 x 10~^/iK^ if the COBE signal is entirely 
generated by scalar modes. For large patches of sky, where the matrix D-^. is nearly proportional to the identity 
matrix, the lensing signal contributes like an additional isotropic noise with crj^i^^g ~ ^i^ns- have checked this 
approximation by computing the following results exactly in particular cases, with agreement to within a few percent 
for patch sizes with 10° < 6* < 80°. 

The effect of the lensing is therefore simply to increase the effective noise by a constant amount. For the Planck 
satellite the effect is small, reducing the At that could be observed by about 1.5 per cent. However for the smaller 
surveys with better sensitivity, considered in Figs. ^ and the effect is much more important. For a one year survey 
of radius 6 with sensitivity s = 10 /iK^/sec the noise variance is given by f{9)a'^ where f{6) = (1 — cos0)/2 is the 
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FIG. 7: The smallest gravitational wave amplitude At (defined as in Ref. j24|) that could be detected at 99 per cent confidence 
with probability (bottom to top) of {0.5,0.8,0.95,0.99} by an experiment with detector sensitivity s = 10 /iKy^sec that runs for 
one year and maps a circular patch of sky of a given radius assuming uniform noise, no lensing, with reionization at 2; = 6.5 
(right) and without reionization (left). The result scales with the square of the detector sensitivity and inversely with the 
duration of the experiment. The dotted lines show the equivalent result if one could perform perfect lossless separation of the 
magnetic modes. 



fraction of the sky which is observed and ctq w 4 x 10^^/iK^. This noise gives the tensor amplitudes At{0) plotted in 
Fig. ^. Incorporating the lensing effect the actual tensor amplitude one could detect in an experiment with sensitivity 
s and duration T is 



'JV.lcns 



(50) 



where At{0) is the amplitude for a one year mission with s — 10 /iK-^sec and lensing ignored (i.e. the amplitude 
plotted in Fig. |^). This allows our previous results to be modified for inclusion of the lensing signal. We have 
plotted the modified results for various survey sensitivities in Fig. ^. The optimal survey size now depends on the 
sensitivity — as sensitivity improves the lensing signal becomes more important and one needs to survey larger scales in 
order to accurately measure the difference in variance expected with the tensor signal. For large patch sizes the tensor 
amplitude At that can be detected in the absence of lensing is proportional to y/f{6). Allowing for lensing, there is 
an optimal survey size at ^tts^ f{9)/T = a]^ ^^^^ [if there is a solution with f{0) < 1], in other words when the variance 
of the instrument noise is equal to the lensing signal. There is a lower limit of At « 4 x 10~^^ that can be measured 
even with perfect sensitivity, when the tensor contribution cannot be distinguished from random sampling variations 
in the lensing signal distribution. This corresponds to an inflation model with energy scale V^^^ « 7 x 10^^ GeV, in 
broad agreement with Ref. |^ for a three sigma detection. This situation could only be improved if one could find 
ways to obtain information about the particular realization of the lensed signal. 
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10 100 
radius of observed area (degrees) 

FIG. 8: The smallest gravitational wave amplitude At that could be detected at 99 per cent confidence with prob- 
ability 0.95 (solid lines) and 0.5 (dashed lines) by an experiment with detector sensitivity (bottom to top) = 
{0, 25, 50, 100, 200, 400} fjK^sec that runs for year and maps a circular patch of sky of a given radius assuming uniform noise, 
a white lensing power spectrum, and a ACDM cosmology that reionizes at z — 6.5. The horizontal dotted line shows the 
amplitude which contributes about 1/10 of the COBE signal. The long dashed line shows very roughly the dust foreground at 
143 GHz. 



We have assumed that component separation and source subtraction can be performed exactly so that the observed 
signal is only lensed CMB. Polarized thermal dust emission is expected to generate a significant magnetic signal 
at roughly the level shown by the line in Fig. ^ at 143 GHz. Separation of this signal from the CMB signal should be 
possible with multi-frequency observations, and it should then not have a significant effect on our results. 



LOSSLESS METHODS 



We now compare the above analysis with truly lossless methods. Lossless, likelihood analysis for CMB polarization 
in pixel space has been considered recently in Rcf. ||2^ . In this section we consider lossless and nearly lossless methods 
in harmonic space. 

A simple way to incorporate most of the magnetic signal for constraining the tensor modes is to use the unprojected 

-1/2, 



III. The null- hypothesis covariance matrix TV can be 



variables D^''' ^U^'B, where £)+ and C7+ were defined in Sec. 
computed including the expected signal from the electric polarization, and the analysis can be performed as before. 
This is marginally superior to using the projected variables if the tensor amplitude is quite high, as shown in Fig. ^ 
for the Planck mission. For smaller tensor amplitudes the entangled linear combinations of E and B modes are 
dominated by the electric component and performing the projection looses very little. Using the projection gives one 
clean separation, and there is no need to know the electric polarization power spectrum. By identifying variables that 
depend only on the electric and magnetic signals at the level of the map we also do not need to assume Gaussianity, 
so we could for example perform Gaussianity tests on the two physically distinct polarization types independently. 

We now consider the full joint analysis of the electric and magnetic polarization, with the pseudo-multipoles E and 
B forming our fundamental data vector 



-iW- W+ 



(51) 
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Since we are no longer worrying about E-B separation so we can equally well use the block-diagonal frame where 
Performing a singular value decomposition of the block-diagonal matrix diag(2VF, -2W) = UDU^ (where the ma- 



trices U and D should not be confused with those defined in Sec. HI), we can identify the well determined linear 
combinations as before 

^/ ^ _5-i/2^tA w £>i/2[/tA. (53) 

This diagonalization is equivalent to defining harmonic coefficients A' with respect to a complete set of spin two 
harmonics which are orthonormal over the patch of sky, in the same way as for the spin zero cut-sky temperature 
analysis ||l^, |l^. As before, this construction ensures that for isotropic noise the noise correlation is diagonal 

(AAAAt) ^ 2a^diag(2W, -2W) =^ N= (AA'AA't) = 2cr^/. (54) 
The signal correlation is given by 

/r<EE I /~iBB r<EE r<BB\ 

s = (A'A't) = r> V2c^t + ^ tjb'/\ (55) 



where the C^^ and C^^ are the diagonal electric and magnetic power spectrum matrices respectively, and we have 
assumed that C^^ = 0. If the noise and signal are Gaussian we can proceed to do a likelihood analysis for the power 
spectra using 

,.n^FF ^BB^ exphiA't(7V + 5')-iA'] ^ ^ 

i(A'|C^^, C^^) cx \N \s\y^ ■ ^^^^ 

The coupling matrices ±2W can be computed quickly for an azimuthally symmetric patch of sky, as described in 
Appendix and modes with different m decouple. The problem is therefore tractable. However it is not nearly so 
simple to find a maximum likelihood estimate of the magnetic amplitude, and in general there will be complicated 
correlations between the recovered power spectra. By using the lossy projection in the previous sections we have 
essentially shown that this likelihood function is 'nearly' separable. Making it separable costs something in terms of 
lost information, but it significantly simplifies the problem. Using the projected variables also reduces the size of the 
matrices, so performing the matrix inversions is significantly faster. 

If the B signal is determined to be negligible one would want to apply an efficient nearly-lossless method to estimate 
the electric power spectrum Cj^^ (or to do parameter estimation), so we now consider the case when one polarization 
type is absent. If the B signal can be neglected we have 



E\ (W+ 



E = UDV^^, (57) 



where we have done a singular value decomposition as before so that we can find the well determined linear combina- 
tions of the Elm ■ 

= D-^/^U « r>i/2vtE. (58) 

The matrices one has to invert to do a likelihood analysis are now one half the size of those in the optimal method 
when both polarization types are present, and so the problem is numerically much faster. However isotropic noise no 
longer gives the simple diagonal noise covariance, though this can always be rectified by using Ar~^/^xs. In practice a 
nearly optimal method would probably be more appropriate using only the unprojected variables D_^_ U\_'E, where 



and [/+ were defined in Sec. III. These variables have diagonal noise properties like the E^f for isotropic noise, 
and the computational saving may be significant when analysing high resolution polarization maps. We have checked 
numerically that including B in the analysis gains very little even for low 1^3.^ and small patches. For large area 
surveys at high resolution the information loss will probably be negligible. 

There are exactly equivalent relations for the well determined magnetic variables in the case when E vanishes. This 
case is of little practical interest, since the E signal would have to be removed to within the magnitude of the B signal, 
and this is impossible on an incomplete sky since the two are not unambiguously distinguishable without accurate 
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boundary data. However, supposing that E could be removed is useful theoretically as we can then compute the best 
obtainable magnetic signal to compare with what we obtain using our projected variables. The information lost due 
to the projection depends on the cosmology. Models with reionization have more power on large scales and a greater 
fraction of the power is lost due to removal of the boundary terms. For our toy model of the Planck satellite we find 
that the amplitude that could be detected at given significance and probability is reduced by about 30 per cent by the 
projection for a cosmology with reionization at z = 6.5, but only by 2 per cent for a zero reionization model. In the 
reionization model one is loosing a lot of the additional information in the low multipoles that in the absence of the 
projection would have high signal to noise. The net result is that the reionization model has an only slightly higher 
chance of giving a tensor detection despite having more large scale power. By using the unprojected variables and 
incorporating the expected electric polarization contamination as an extra noise term one can approximately halve 
this loss. The lossless result is compared to the realistic projected result for general circular sky patches in Fig. ^ for 
an observation with much higher sensitivity. The cost we incur by using the non-optimal method in terms of slightly 
larger error bars on the B signal, or a less powerful test of detection at a given significance, is small for large survey 
areas though it does increase for small sky patches. For these sensitive observations the electric signal is much larger 
than the magnetic signal and essentially nothing is lost by performing the projection rather than including electric 
contamination as a large extra noise. 

To make a detection of the magnetic signal on such a small sky patch with the planned long duration balloon 
observations the tensor/scalar ratio would need to be significantly larger than one, which is too large to be allowed 
by the current temperature anisotropy observations |28| . Of course, seeing if there is only a small magnetic signal is 
an important consistency check for current models with low tensor/scalar ratio to pass. 

One simple way to reduce the information loss in our method would be to use data objects that include not only 
the surface integrals I±2,w, but also those parts of the boundary terms in Eqs. and (^) that do not depend on 
3 P on the boundary. Such objects would separate electric and magnetic polarization exactly if the scalar window 
functions were constructed to vanish on the boundary. The problem of producing a non-redundant set of such 
windows could be tackled with a simple variant of the harmonic- based method presented in Sec. [IL The additional 
boundary contribution would cancel that part of W that couples to the normal derivative of the window function 
on the boundary, leaving a single non-zero singular value (for |m| > 0) to project out. The net effect would be 
that for azimuthal patches we would gain one extra variable per m for |m| > 1, though the noise properties of these 
variables would not be as simple as if the line integrals were not included, and the problem of performing line integrals 
with pixelized data is non-trivial. For reionization models (which have significant large scale power) the reduction 
in information loss may be worth the effort required to overcome these obstacles, though a full analysis with the 
non-separated variables would probably work better. 



VI. CONCLUSION 



We have considered the problem of producing statistically independent measures of the electric and magnetic 
polarization from observations covering only a portion of the sky. Although the separation of the polarization field 
into electric and magnetic modes is not unique in the presence of boundaries, we have shown how to construct window 
functions that are guaranteed to probe separately the electric and magnetic polarization exactly over azimuthally- 
symmetric patches of the sky. We presented a harmonic-based method for efficient construction of the windows that 
automatically removes redundancy due to the finite sky coverage. In addition, our window functions return separated 
electric and magnetic variables that have very simple diagonal noise correlations for idealized noise on the polarization 
map. For azimuthal patches separating the electric and magnetic polarization comes at the cost of losing two pieces of 
information per m mode, or roughly twice the number of pixels of area (Tr/Zmax)^ on the boundary of the patch. For 
large patches this information loss is small unless there is large scale power due to reionization, but for smaller patches 
it can be more severe due to the limited support of the high m spin-weight 2 harmonics in the patch. Although we 
have proved that our method gives exact separation for azimuthal patches, the harmonic-based construction should 
produce window functions that give approximate separation for arbitrarily shaped patches with similar information 
loss to the azimuthal case. 

We showed how the variables constructed from our window functions could be used to constrain the amplitude of the 
magnetic signal without contamination from the much larger electric signal. For the first time, we made predictions 
for the tensor amplitude that Planck should be able to detect taking proper account of excluding the galactic region. 
If other non-negligible foregrounds can be removed using the other frequency channels, Planck should be able to 
detect the magnetic signal predicted by some simple infiationary models. For less sensitive observations, our window 
functions should nevertheless be useful to set upper limits on the magnetic signal, and may also aid the identification 
of systematic effects in the instrument or analysis pipeline. 

If the magnetic signal is shown to be consistent with noise then we showed how one can use all the well determined 
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polarization pseudo-multipoles to analyse the electric polarization power spectrum without loss of information. The 
analysis using these variables is no more complicated than the analysis of temperature anisotropics using cut-sky 
orthogonalized scalar harmonic functions |ll| . 

We have only considered isotropic noise here, however, as long as the noise is azimutlially symmetric the separation 
of m modes will still work, and the problem remains computationally tractable though rather less simple. In practice, 
there will be several other complications in real-life CMB polarimetry observations that will impact on the map- 
making and subsequent analysis stages. Further careful investigation of the propagation of instrument effects such 
as beam asymmetries, straylight, cross-polar contamination, and pointing instabilities through the map-making stage 
will be required before the programme for analysing magnetic polarization outlined in this paper will be realizable. 
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APPENDIX A: SPIN RAISING AND LOWERING OPERATORS 



In general a spin-weight s quantity ^ry is defined over a two-dimensional Riemannian manifold with respect to an 
orthonormal diad field {ei, 62}. The local freedom in the choice of diad amounts to the transformations 

e± = ei ± ie2 e^^'e± (Al) 



of the (complex) null vectors e±. A quantity is defined to be of spin- weight s if under the transformation (Al) 
sTj sTye'*^. To every spin-weight s object sTj we can associate a (complex) symmetric trace-free, rank-|s| tensor 
Va, = ilai...a/- for s > 0, 



r?^= =2-%r;e^, (A2) 



where the irreducible tensor product = . . . e°i" . The inverse relation is 

gri = r]A,e^'\ (A3) 

For s < we define Ty'^i^i = 2^l''lsr/e^''' . The spin raising and lowering operators 9 and S are defined by the null 
diad components of the covariant derivatives of 77yi|^|: 

^±\s\V = -«V,77A,Je±i^ (A4) 
9 ±1.1'/ = -(elVe77A|Je±'^ (A5) 

(The minus signs are conventional). 

In CM B polarimetry we are concerned with fields defined over the sphere, in which case the transformation in 
Eq. ( [AlD corresponds to a left-handed rotation of the diad about the outward normal f . Choosing the orthonormal 
diad to be aligned with the coordinate basis vectors and </) of a spherical polar coordinate system, we have ej.Vaej_ = 
cot 6'ej_ and e° VqCj. = — cot 6e'^. It follows that for this choice of diad the spin raising and lowering operators reduce 
to 

3^77 = -sin'0{dg+icsc0d^){sm-'0sr]), (A6) 
3,77 = -sin-^" 9{d0 -icsc0d^){sm'' e,r]). (A7) 

An elegant interpretation of the spin raising and lowering operators on the sphere can be obtained by considering 
spin-weight s objects s'7(^, </>, "0) defined on a diad at (position-dependent) angle ^ to the coordinate directions, so 
that 

,77(^,0,^) =.ry(0,0)e'^'^, (A8) 



^ http://www.nctlib.org/lapack/ 
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where sr?(6', 0) is defined on and </>. In this case, the spin raising and lowering operators can be related to the 
angular momentum operators for a rigid body [^6| . Working in a representation where the orientation of the body is 
specified in terms of Euler angles ((/>, 9, the angular momentum operators on the body-fixed axes take the form 

K, = id^, (A9) 
K± = €^'"^{±89 + i CSC 9d^ + i cot 0d^), (AlO) 

where K± = ± iKy. These operators satisfy the commutation relations 

[K,,K±] - tA±, [K+, - -2K,, (All) 

so that K± are lowering/raising operators with respect to the eigenvalues of K^- Note that the signs in these 
commutation relations are different from those for angular momentum operators on a fixed frame since on the body- 
fixed axes we have [K^, Ky] = —iK^ | |3C| ]. The action of the spin raising and lowering operators can then be formulated 
in terms of the angular momentum operators as 

if+,77(0,0,V) = -e^(^+i)'^ 9,77(0,0), (A12) 
X_,77(0,0,V) = +e^(^-i)'^ 5,77(0,0). (A13) 

Several useful results for the spherical raising and lowering operators follow from the commutation relations ( [All| ). 
[Similar relations on a general manifold can be found from Eqs. (A4) and (A5).] For a spin- weight s quantity defined 
on the coordinate basis, 

(5 9 - 5 g),?7(0,0) = 2s,?7(0,0), (A14) 
[5 5 - s(s - 1)],77(6',0) = (V^ -s^ csc^ 6* -I- 2is cot 61 CSC 6* a^), 77(0,0), (A15) 

where we have used K^K_ = — — to derive the last identity. Applying these relations repeatedly to a 
spin- weight quantity we find 

5 5 5 5o77(0,0) = 5 5 5 5o77(^,0) = (V^ + 2)V2n77(0, 0), (A16) 

and the useful relation 

(55^- 5^5)077(^,0) -s(.s-l)5^-io?y(e,0)- (A17) 



Integral theorems 



The integral of the spin- weight quantity 9 _i7; over some portion S of the two-dimensional manifold is determined 
by the integral around the boundary dS: 



dS 9_i77 6 idZ_i7;, (A18) 

S JdS 

where idl is the spin-one element of length around the boundary: 

id; =idlael. (A19) 



On the sphere in the spherical polar coordinate frame id/ = id9 — sin0d0. Equation (A18) is the complex represen- 
tation of Stokes' Theorem and the Divergence Theorem. An equivalent result holds for spin-weight one quantities by 
forming the complex conjugate of the above, with id/ = —idlae°_. 

The spin raising and lowering operators obey Leibnitz' rule, so there is a 'Green's identity' 

pggQ-Qg9P= 9(pgg)- g(Q5F), (A2o) 



^ Our convention for Euler angles (q, /3,7) follows Ref. psj] , i.e., successive right-handed rotations by 7, /3, and a about the z, y, and 
2-axes respectively. The use of tp, which is minus the third Euler angle, as a configuration variable for the rigid body is necessary to 
relate the angular momentum operators directly to the spin raising and lowering operators with the (consistent) conventions we have 
adopted here. 
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where P and Q have definite spin weight. For PQ with spin- weight zero integrating over a surface gives the integral 
theorem 

[ dS{PddQ-QBdP)^ (f idlPdQ~(f idlQBP. (A21) 
Js Jas Jas 

A similar result is obtained using 

PddQ-QBBP^ d{PBQ-QdP) (A22) 

for PQ of spin weight —2 which gives 

/d5(P5 gg-gg gp) = / id;(pgg-ggp), (A23) 

Js Jas 
with a similar result for a spin-weight 2 quantity. 



APPENDIX B: SPIN WEIGHTED SPHERICAL HARMONICS 



The spin- weight s spherical harmonics sYim are defined in terms of the usual spherical harmonics Yim by 



^ l {l-\s\)\ 

s-^lra — 1/ /7 , I i\i ^ ^Im-) 



{l + \s\)\ 



(Bl) 



where g I*' = (— l)''gl''l. They are non-zero for \s\ < l,\m\ < I. By making use of Eqs. (A12) and (A13), and the 
properties of the K± operators when acting on Wigner D-matrices"* E^, Ell , it is straightforward to show that ||2l|] 



4tt 
21 + 1 



sYirn{0,4>)e 



isip 



(B2) 



With the conventions adopted here, gYim — (~l)''^™-s^i(-m)- Orthonormahty of the spin weight harmonics over the 
full sphere, 



(B3) 



follows from the orthogonality of the D-matrice s ove r the S0(3 ) group manifold. 

Applying K+K- to Eq. (B2), and using Eqs. (A12) and (A13), one can show that the spin weight harmonics satisfy 
the differential equation 



,Ylra ^ [sis - 1) - l{l + 1)] sYl, 



(B4) 



[An alternative proof of this result follows from Eq. ( A17 ).] The spin weighted harmonics are separable and can be 
written as 



= ,Ai™(cos0)e™^ 



The sAim satisfy the self-adjoint equation 



2, , rn^ + s'^ + 2'msx 



where a prime denotes differentiation with respect to a; = cos I 



(B5) 



(B6) 



* Our conventions for the D-matrices follow Refs. ^ij . We adopt the Condon-Shortley phase for the spherical harmonics, which differs 
from that used by Goldberg et al. pa] by a factor of (—1)'". 
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The s^im can be evaluated recursively for m > |s| starting with 
and the recursion relation (derived from standard results for the Wigner ZJ-matrices; see e.g. Ref. [ ^ ) 

sA/m — (x + -J- -T- ) C'slms^(l-l)m — 7^ sA(i_2)m, (B8) 

where 



^ / P(4F-1) 

The harmonics for m < — |s| can be obtained from s\(-m) = {—^Y'^"^ -sMm- A straightforward way to evaluate s^im 
for \m\ < \s\ is to compute ±n^i\s\ for < n < |s| and then use the symmetry s^im — {—l)^~^^m^is- Further useful 
results for the spin weight harmonics can be found in Refs. |2^, and expressions for the spin- weight ±2 harmonics 
in terms of the associated Legendre functions are given in Ref.^^. 



APPENDIX C: OVERLAP INTEGRALS 



Setting P = sYi*^, and Q — sYim in Eq. (A21), and using the differential equation (B4) to simplify the integrand 
on the left-hand side, we have 



Js 



dS 



id/ sYim 3 sY,f^,. 



(CI) 



dS 



This expresses the overlap integral for I ^ V m terms of a line integral around the boundary of S. 

For an azimuthally symmetric surface the integrals are particularly straightforward as the spin-weight harmonics 
with differ ent m are orthogonal over the patch. The overlap integral for the same m but different I can be determined 
from Eq. ^ to be 



[1(1 + 1) - I' {I' + 1)] / da; sXl'm sXlm = (1 - x'')[s\'i,„,shra - sAj^.A^mla- 



(C2) 



[This result also follows directly from Eq. (B6).] The derivatives can be removed while maintaining homogeneity in 
the spin weight by using 



/(I - X^)s\'im = -{sm + fx)s\lm + ■^^77^-^sA(/_i) 



C. 



slm 



(C3) 



to write the I' ^ I integral as 



^h" = 27r / dxsXi'msXim 



2w 



{I + I' + - I') 



\X ifT) ^ ' )sA/'msA/„ 



21' + 1 



2^ + 1 



Note that AIJ? = 



"I" s\l' -l)msXlm j:, s\l~l)msXl'm 

. The I = I' ^ \m\ integrals can be evaluated recursively using 



Asm _ Asm , C^slm .sm C slm .sm . 2sm ^ „j 



which follows from Eq. (B8). For \m\ > |s|, the starting values ^f™||„j| can be obtained from 

2tt[xsXjj^jj^]^ 



A sm 
mm 



A 



s(m — 1) 
(m— 1) (m— 1) 



" I m 1 1 m 
S 



2m + 1 

27T[XsXl^_^^t 



-.A 



s(m — 1) 



As(-m) _ As(-m+l) 

^mm ^ 2m + 1 



V(2m+l)(m2-s2) m(m-ir 

:A1(7"+V (m > \s\), 



y/{2m+ l)(m2 



^(m — 1) 



(C4) 

(C5) 

(C6) 
(C7) 
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which follow from the explicit form of 
solutions 



given in Eq. (JBT]). For I' = I = m. 



the integrals have the analytic 



2m+l 



2m+l 



(C8) 



For |m| < \s\, the starting values Aj^Jjl^jfor the recursion in Eq. (C5) can be obtained from the symmetry AfJT = A^,'' 
and the recursion relations ( |C6| ) and (C7). 

If one is also generati ng t he spin-weight zero overlap integrals for analysing the temperature field an alternative 
approach is to use Eq. ( A23| ) to relate the integrals of the spin- weight two and zero harmonics. In general integrals 
for different spin weights can be related by 



ki(s-i)ku 



dS 



dS 



idl (fc;/(<;„i) sF/^s-lll'm' + kls s-2Yl'm' s~lYi^) 



where kis = + 1) — s{s — 1), and we have used the results 

^ sYlrn — ^i(s+l) s+lYlr, 
^ sYlm — kls s — lYlra 



(C9) 



(CIO) 
(Cll) 



which follow from Eq. (Bl). For s = and s = 2 one can obtain the spin ±2 integrals in terms of the spin zero 
integrals. The spin zero integrals are computed using the above relations with s = 0, in agreement with the relations 
given in Ref. |32|| . 

For a small patch of sky a large number of the overlap integrals are going to be very close to zero. This makes sense 
intuitively, and is easy to see more quantitatively. From the differential equation (B6) the character of the harmonics 
changes from oscillatory to decaying at the point where 



2msx 



1 



1(1 + 1). 



(C12) 



For X nearer the poles than the critical value the harmonics become very small. For a sky patch extending from the 
north pole to x, for large I we see that for larger than « P{1 — x'^) the harmonics will be small within the 
patch. Hence the overlap integrals with > min(Z^, I' )(1 — x^) will be small, corresponding to one (or both) of the 
harmonics being localized out of the sky patch. 



APPENDIX D: STATISTICS OF WEAK SIGNAL DETECTION 



After obtaining a vector of observed data 'Bw two questions one might ask are: (1) What is the probability 
that the signal is just noise?; (2) Given the signal is not noise, what can we say about the amplitude r? The 



likelihood function (48) encapsulates all the information in the data concerning the amplitude r, and gives the 
posterior probability distribution of r after multiplying by the prior. Question (1) is really asking for a comparison 
of two models, one in which r = (or r < e where e is small), and one in which r > 0. Given there is a prior 
probability p that the signal is pure noise (r = 0) and probability 1 — p that r > 0, distributed with the normalized 
prior probability distribution /(r), the posterior probability that r = is given by 

Pir = o\B^)= n^"T"'u-d rm i ^H V ^^^^ 

pL{Bw\r = 0) + (1 - p) drL{Bw\r)f{r) 

The posterior probability tells us the probability that the signal is pure noise once we have a particular set of data. 
However, it doesn't immediately tell us that we might expect to obtain from a given observation, just what we know 
once the observation has been performed. The posterior distribution can depend strongly on the prior. 
The posterior probability is a monotonic function of the Bayes factor 

/-d.L(B^|r)/(.) 
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In classical hypothesis testing the likelihood ratio L(Bvy |r')/L(Bvi/ |r = 0) is the most powerful test statistic for 
distinguishing a model with r = r' from one with r — Q (for details see Ref. |Q ) . The largest fraction 1 — a of the 
values of the likelihood ratio under the null-hypothesis (r — 0) determine a region, which, if the observed ratio falls in 
it, rejects the null hypothesis at significance a. The distribution of the likelihood ratio is straightforward to compute 
using Monte-Carlo techniques. In general we do not have some fixed alternative hypothesis r — r' \ one possibility is 
to use the the statistic t, formed by marginalizing over some prior, in place of the likelihood ratio. Often the prior is 
fixed to 5{r — f), where f is the maximum likelihood estimate of r, so the test reduces to a likelihood ratio test with 
the alternative hypothesis r = r. 

By the time magnetic polarization comes to be observed we should have some prior information about the tensor 
amplitude from accurate measurement of the temperature power spectrum. However at this point we do not have 
very useful information about the prior distribution, and to consider the possible results of future experiments it is 
useful to assume no information. The maximum entropy prior (the uniform prior) is probably not appropriate in this 
case — do we really think that all values of r (below a certain bound) are equally likely rather than, say, all values 
of Inr being equally likely? The uniform prior gives radically different answers depending on the choice of variable. 
The Jeffreys prior 

I pp. \ 1/2 

/(r)cx /-— lnL(r|B^)\ ^ \\,x{{N + vSy^ S{N + rSy^ S\^'^ , (D3) 

is reparameterization invariant [^^ , and therefore does not suffer from this problem. The Jeffreys prior goes like 
1/(1-1-7') if A/" = S and in general is improper (does not have a finite integral), though this is not a problem for 
evaluating test statistics since the integral of the product with the likelihood function will be finite. It is also not 
a true prior in the sense that it depends on what data is going to be collected, but this is really a good thing as it 
concentrates the prior probability in the region where we need it in order to obtain a detection. Since the prior is not 
localized the posterior probability of the null-hypothesis will depend on where one cuts off the prior. However if one 
only ever compares the Bayes factors t the cut-off is not very important since the likelihood function will be localized. 
For this reason performing classical hypothesis tests using t is rather more independent of the prior information than 
considering the values of the posterior probability, though clearly it is the latter which is rigorous and contains all the 
available information. 

Classical hypothesis tests are useful for assessing probability of getting a detection at a given significance, though it 
should be remembered that getting a detection at 99 per cent confidence does not mean a the probability of 1 per cent 
that the signal is pure noise. However for sensible priors there will be a close correspondence; classical and Bayesian 
techniques agree that high values of t correspond to the null-hypothesis being less likely. Using classical hypothesis 
tests to compute the probability of getting a detection at a given significance for a given true r is significantly simpler 
than computing and interpreting the corresponding distribution of the posterior probability distributions. 

For full sky surveys there are a large number of statistically-independent magnetic variables, which can be thought 
of as the eigenvectors of rN~^l'^ SN~^I'^ . Those variables corresponding to large eigenvalues have high expected 
signal to noise and are most useful for obtaining detections. If has dimension n there will be n independent 
variables. If the eigenvalues of rN~^l'^ SN~^I'^ are distributed fairly uniformly, and we consider expected chi-squared 
detections of order one sigma, all the eigenvalues will be ^ '^l\fn and we have rS ^ TV. Using a second order 
approximation we then have 

(iV rS)-^ w (7 - rN-^S^ N-'^ SN-'^ S)N-\ (D4) 

and 

ln|Ar + rS'| = ln|iV| +ln|/ + rAr-iS'| « lrL\N\ + tr{rN-^ S - ^r^ N''^ SN''^ S). (D5) 

In this approximation the entire likelihood distribution is simple to compute since the matrix manipulations only need 
be performed once, and is of the Gaussian form 

L{Bw\r) = L{Bw\r = 0)e^" /^^"e-^"-^^" (D6) 

where 

l/cr^ ^ bI^N-^SN-^SN-^Bw - ^ti{N-^SN-^S), (D7) 
and the maximum likelihood estimate for r is 

f « ifj^ \b\^N-^SN-^Bw - tr(iV-iS')] . (D8) 
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The maximum likelihood estimate for r is only weakly biased for small signal to noise. 

Consider fixing the signal hypothesis to r = f , which gives the largest likelihood ratio possible for any prior. The 
likelihood ratio is then a monotonic function of the test statistic 

, ^, BlN-^SN-^Bw-tv{N-^S) 
V =r/a= — . (J-*") 

^JA'B^^N-^SN-^SN-^^w - 2tr{N-^SN-^S) 

The quantity f/a gives the number of standard deviations the maximum likelihood is from pure noise, which is an 
good intuitive measure of the number of 'sigmas' at which the magnetic signal has been detected. If any eigenvalues 
become large enough then the small r approximation will fail and one needs to compute the full likelihood ratio to 
obtain optimal results. The statistic is a function only of the vector N'^^^Bw and the matrix N^^^^SN^^/"^ , so 
working in the frame in which the matrix is diagonal the statistic is simple to compute. Note that and f can be 
negative and correspond to a non-detection. 

The second order approximation can also be used to speed up computing the full likelihood function for integration 
against a prior. In the diagonal frame it is straightforward to identify any modes with high signal to noise — if there 
are some then the likelihood for these modes can be computed exactly. The likelihood for the remaining low signal to 
noise modes can be computed quickly using the approximation in which the likelihood is Gaussian. Multiplying these 
together allows the full likelihood function to be computed, and hence the the probability distribution of t. Whilst 
this is slower than using the v' statistic it is a useful check, and may be essential when there are very high signal to 
noise modes. 

For good detections the prior is not very important as long as it is not small over the bulk of the likelihood. If 
the prior is approximately constant over most of the likelihood integral, the second order approximation is valid, and 
f/(T ^ 0, then t ~ e'' /^"^ g(f, a) where g(f, a) depends weakly on f and a compared to the exponential. In this case 
t depends almost entirely (and monotonically) on the value oi i/' = f/a, which is why ly' is a good statistic to use. 
We found that detection probabilities computed using full likelihood results with the Jeffreys prior tend to agree very 
closely with those computed using v' . 

The advantage of using v' rather than computing the full likelihood distribution is that it is computationally 
significantly simpler and faster, which is useful, though by no means essential, for performing accurate Monte-Carlo 
computations. There is no problem computing using the full likelihood distribution exactly from single samples of 
actual observed data. 



Busting the null-buster 

In the limit that r ^ 1 the likelihood function can be approximated by the first two terms in its Taylor series in r. 
The likelihood ratio for any given (small) r is then a monotonic function of dr lnL(Bvi/|'')lr=o' which is independent 
of r and therefore provides the uniformly most powerful test. For Gaussian signal and noise it is proportional to 
2f/tT^. After dividing by the root of the variance in the null hypothesis ((2f/cr^)^)|r=o = 2tr(N^^SN^^S) we obtain 
the quantity 

y/2ti{N-^SN-^S) 

which is the optimal^ quadratic 'null-buster' statistic introduced to the CMB literature in Ref. j35|. In the limit in 
which all the eigenvalues of rN^^^^ SN~^^^ tend to zero this is the optimal test statistic. In general it is not — a 
signal that can be detected will violate this assumption. For a very large number of approximately equal eigenvalues 
the value of a will approximate (1/ct^)~^/^ = [^tT{N~'^ SN"-^ S)]~^^'^ , the Fisher curvature for small r. In this limit 
the null-buster remains a good statistic and v' /v 1. However in general v' performs significantly better when the 
eigenvalues are distributed less evenly, or when there are not that many eigenvalues. As the signal to noise increases 
v' also performs much less sub-optimally than the null-buster. 

For the case of magnetic polarization observations the u' statistic outperforms the null-buster for a wide range 
of patch sizes in realistic reionization models. The large scale magnetic signal coming from low redshift (z < 10) 



It is the quadratic statistic which gives the maximal expected detection in units of the expected standard deviation under the null 
hypothesis. None of these properties are required or even especially desired (we are more interested in getting detections at high 
significance with high probability). 
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reionization gives a small number of modes with relatively high signal to noise (see Fig. |j) , and the conditions under 
which the null-buster is a good statistic are therefore not satisfied. The qualitative reason that the null-buster performs 
poorly is that the position of the maximum and the curvature of the likelihood function are correlated, so dividing 
the actual maximum by the expected curvature does not give you an accurate measure of the number of 'sigmas' from 
zero for a particular observation. This makes the null hypothesis distribution unnecessarily broad at large values, and 
therefore makes it harder to rule out the null hypothesis with good significance. The v' statistic has a much sharper 
distribution than the null-buster (which has a distribution similar to chi-squared) in the alternative hypothesis, and 
the value of v' corresponds much more closely to the significance (measured in Gaussian-like 'sigmas') of the result. 

A few points can be made in the null-buster's defence. Firstly it is slightly easier to compute than v' . Secondly, 
since we motivated the v' by assuming a Gaussian signal it is conceivable that the null-buster could perform better 
with certain non-Gaussian signals. Lastly, the null-buster is quadratic which makes it easy to calculate the mean and 
variance analytically. However it is clear that with Gaussian signals using the null-buster is in general significantly 
sub-optimal. 
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